Proposed Approach For Web Page Access Prediction Using Populartiy And Similarity Based Page Rank Algorithm
نویسنده
چکیده
Nowadays, the Web is an important source of information retrieval, and the users accessing the Web are from different backgrounds. The usage information about users are recorded in web logs. Analyzing web log files to extract useful patterns is called Web Usage Mining. Web usage mining approaches include clustering, association rule mining, sequential pattern mining etc. The web usage mining appr oaches can be applied to predict next page access. In this paper, we proposed a Page Rank-like algorithm is proposed for conducting web page access prediction. We extend the use of page rank algorithm for next page prediction with several navigational attributes, which are the similar ity of the page, size of the page, access-time of the page, duration of the page and transition(two pages visits sequentially) and frequency of page and transition.
منابع مشابه
Proposed Approach For Web Page Access Prediction Using Popularity And Similarity Based Page Rank Algorithm
Nowadays, the Web is an important source of information retrieval, and the users accessing the Web are from different backgrounds. The usage information about users are recorded in web logs. Analyzing web log files to extract useful patterns is called Web Usage Mining. Web usage mining approaches include clustering, association rule mining, sequential pattern mining etc. The web usage mining ap...
متن کاملA Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...
متن کاملA New Hybrid Method for Web Pages Ranking in Search Engines
There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...
متن کاملتشخیص ناهنجاری روی وب از طریق ایجاد پروفایل کاربرد دسترسی
Due to increasing in cyber-attacks, the need for web servers attack detection technique has drawn attentions today. Unfortunately, many available security solutions are inefficient in identifying web-based attacks. The main aim of this study is to detect abnormal web navigations based on web usage profiles. In this paper, comparing scrolling behavior of a normal user with an attacker, and simu...
متن کاملData Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...
متن کامل